RESULT 5 

AL365193/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



FEATURES 

source 



ncrf frmx" 

Craniata; Vertebrata; Euteleostomi ; 
Catarrhini ; Hominidae ; Homo . 



repeat 



AL365193 51887 bp DNA linear PRI 22-NOV-2000 

Human DNA sequence from clone RP3-52 6N11 on chromosome 6, complete 
sequence . 
AL365193 

AL3 65193 .12 GI : 11322 849 
HTG. 
human . 

Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 51887) 
Tracey, A. 
Direct Submission 

Submitted (22 -NOV-2000) Sanger Centre, Hinxton, Cambridgeshire, 
CB10 ISA, UK. E-mail enquiries: humquery@sanger.ac.uk Clone 
requests : clonerequest@sanger . ac . uk 

On Nov 23, 2000 this sequence version replaced gi : 11190578. 
During sequence assembly data is compared from overlapping clones. 
Where differences are found these are annotated as variations 
together with a note of the overlapping clone name. Note that the 
variation annotation may not be found in the sequence submission 
corresponding to the overlapping clone, as we submit sequences with 
only a small overlap as described above. 

This sequence has been finished according to sequence map criteria 
as follows. An attempt is made to resolve all sequencing problems, 
such as compressions and repeats, but not necessarily within known 
annotated human repeat sequence elements (e.g. Alu) . Where the 
sequence is ambiguous, there is an annotation using the 'unsure' 
feature key. 

The following abbreviations are used to associate primary accession 
numbers given in the feature table with their source databases*. 
Em:, EMBL; Sw : , SWISSPROT; Tr : , TREMBL; Wp : , WORMPEP; Information 
on the WORMPEP database can be found at 

http : //www. Sanger . ac .uk/Proj ects/C_elegans/wormpep This sequence 
was generated from part of bacterial clone contigs of human 
chromosome 6 , constructed by the Sanger Centre Chromosome 6 Mapping 
Group. Further information can be found at 
http : / /www. sanger . ac . uk/HGP/Chr6 

RP3-526N11 is from the library RPCI-3 constructed at the Roswell 
Park Cancer Institute by the group of Pieter de Jong. For further 
details see http://bacpac.med.buffalo.edu/ 
VECTOR: pCYPAC2 

IMPORTANT: This sequence is not the entire insert of clone 
RP3-526N11 It may be shorter because we sequence overlapping 
sections only once, except for a 100 base overlap. 

The true left end of clone RP3-466P17 is at 51785 in this sequence. 
The true right end of clone RP1-28C20 is at 100 in this sequence. 

Location/Qualifiers 

1. .51887 

/organism= "Homo sapiens" 
/db_xref ="taxon: 9606" 
/ chromosome = " 6 " 
/clone="RP3-52 6Nll" 
* /clone_lib="RPCI-3" 
_region 2. .1199 





/note= 


"L1M4 repeat: matches 2626. 


.3805 of consensus" 


misc feature 


complement (1106 . . 1664) 






/ note = 


"match: GSS: Em: AQ506127 11 




repeat reqion 


23 90 . 


.2538 






/ note= 


"LI repeat: matches 4015. . 


4167 of consensus" 


repeat region 


3459 . 


.3774 






/ note = 


"AluYb8 repeat: matches l. 


.316 of consensus" 


repeat region 


3948 . 


.4156 






/note= 


"HAL1 repeat: matches 511. 


.747 of consensus" 


repeat region 


4168 . 


. 5043 






/not e= 


"L1PA14 repeat: matches 5266. .6149 of consensus" 


repeat region 


5077 . 


.5807 






/note= 


"L1PA14 repeat: matches 4535. .5263 of consensus" 


repeat region 


5804 . 


. 6152 






/ note= 


"HAL1 repeat: matches 170. 


.526 of consensus" 


repeat region 


7143 . 


. 7294 






/ note= 


"LTR41 repeat: matches 1. . 


175 of consensus" 


repeat region 


7866 . 


. 7927 






/ note= 


"31 copies 2 mer tt 69% conserved" 


repeat region 


8400 . 


. 8570 






/ note= 


"MER6 9B repeat: matches 1047. .1225 of consensus" 


repeat region 


8671 . 


. 9549 






/ note = 


"MER69B repeat: matches 1. 


. 914 of consensus " 


misc feature 


9600 . 


. 10107 






/ not e = 


"match: GSS: Em:AQ459097" 




repeat region 


9960 . 


. 10172 






/ not e= 


"L2 repeat: matches 2522. . 


2 74 9 of consensus " 


repeat region 


13463 . 


. 13792 






/note= 


"MER 9 3 repeat: matches 67. 


.3 97 of consensus" 


repeat region 


14604 . 


. 14914 






/note= 


"MER44A repeat: matches 51. 


.33 3 of consensus" 


repeat region 


16516 . 


. 16811 






/note= 


"AluSgl repeat: matches 1. 


.296 of consensus " 


repeat region 


19348 . 


.21583 






/ note= 


"TIGGERl repeat: matches 1. 


.2249 of consensus" 


repeat region 


21704 . 


. 22053 






/note= 


"THE IB repeat: matches 1. . 


3 64 of consensus" 


repeat region 


22079. 


.23644 






/ note= 


"THE1B- INTERNAL repeat: matches 5. .1580 of 




consensus " 




repeat recn on 


23645 . 


.23993 






/ note= 


"THE IB repeat: matches 1. . 


364 of consensus" 


repeat region 


24238 . 


.24531 






/note= 


"AluJb repeat : matches 1 . . 


2 90 of consensus" 


repeat region 


24731 . 


. 24990 






/ note= 


"L2 repeat: matches 2229. . 


2500 of consensus" 


repeat reqion 


24991. 


.25362 






/note= 


"THE1C repeat: matches 1. . 


371 of consensus" 


repeat__region 


25363 . 


.25552 






/ note= 


"L2 repeat: matches 2044. . 


222 9 of consensus " 


misc_f eature 


25534 , 


.25976 






/note= 


"match: GSS: Em: AQ786188 11 




repeat_region 


26695 . 


.27052 






/ note= 


"MER1B repeat: matches 1. . 


337 of consensus" 


repeat_region 


27392 . 


.28695 






/note= 


"L1MA9 repeat: matches 498C 


. .6308 of consensus" 


repeat_region 


28981. 


.29439 





repeat_region 

repeat_region 

mis cofeature 

repeat_region 

repeat_region 

misc__feature 

misc__f eature 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

repeat_region 

misc_f eature 

repeat_region 

repeat_region 

repeat_region 

repeat__region 

repeater eg ion 

mi sc_f eature 

repeat__region 

mi sc_f eature 

misc_f eature 

repeat_region 

misc__f eature 

misc_f eature 

repeat_region 

BASE COUNT 16111 
ORIGIN 

Query Match 



/note="MLTlC repeat: matches 1. .459 of consensus 1 ' 
29528. .29685 

/note="MIR repeat: matches 93. .262 of consensus" 
29702. ,30591 

/note="L2 repeat: matches 1798. .2716 of consensus" 
31958. .32309 

/note= "match: STS : Em:G24207" 
32486. .32705 

/note="MIR repeat: matches 10. .248 of consensus" 
33224. .33292 

/note="MIR repeat: matches 57. .12 6 of consensus" 

complement (34798. .353 62) 

/note= "match: GSS : Em:AQ627350" 

complement (34924 . . 35379) 

/note= "match: GSS: Em:AQ127321" 

36912. .37203 

/note= "AluSx repeat: matches 1. .2 90 of consensus" 
37323. .37548 

/notes"MIR repeat: matches 6. .252 of consensus" 
38453. .38482 

/note="15 copies 2 mer ca 100% conserved" 
40441. .40644 

/note="MIR repeat: matches 22. .235 of consensus" 
41172. .41635 

/notes"L2 repeat: matches 2264. .2709 of consensus" 
41939. .42389 

/note="match: GSS: Em : AQ126602 " 
42301. .42460 

/note="LlM4 repeat: matches 3751. .3918 of consensus" 
42800. .43080 

/note= "AluSx repeat: matches 1. .2 95 of consensus" 
43229. .43533 

/note= "AluSx repeat: matches 6. .3 09 of consensus" 
43546. .43712 

/note="MIR repeat: matches 71. .24 0 of consensus" 
45906. .46019 

/ note= "MER5B repeat: matches 66. .178 of consensus" 
complement (47 72 8. .48255) 
/note= "match: GSS: Em:AQ786765" 
47743 . .48126 

/ note= "MLT1B repeat: matches 1. .3 88 of consensus" 
48248. .48771 

/note= "match: GSS: Em:AQ680415" 
48263 . .48754 

/note="match: GSS : Em: AQ2054 91 " 
49005 . .49762 

/note="L2 repeat: matches 1890. .2742 of consensus" 
50377 . . 50863 

/notes "match: GSS: Em:AQ478674" 
complement (50429 . .50848) 
/note="match: GSS: Em:AQ525841" 
50541 . .50635 

/notes "MER5A repeat: matches 4. .96 of consensus" 
a 9944 c 9493 g 16339 t 



73.4%; Score 2297.4; DB 9; Length 51887; 



Best Local Similarity 99.4%; Pred. No. 0; 

Matches 23 90; Conservative 0; Mismatches 6; Indels 9; Gaps 8; 




RESULT 12 


AAT1 


8566 


ID 


AAT185 66 standard; cDNA ; 181 BP. 


XX 




AC 


AAT18566; 


XX 




DT 


05-NOV-1996 (first entry) 


XX 




DE 


Human chondrocyte RMA derived cDNA mol . TAU 10(1) . 


XX 




KW 


Human; chondrocyte; gene specific; primer; probe; isolation; 


KW 


int erleukin- lbe ta ; IL-lbeta; diagnosis; connective tissue disease; 


KW 


oseteoarthrit i s ; rheumatoid arthritis; ss. 


XX 




OS 


Homo sapiens. 


XX 




PN 


EP705842-A2 . 


XX 




PD 


10-APR-1996 . 


XX 




PF 


02-OCT-1995; 95EP-0115510 . 


XX 




PR 


0 6 -OCT- 1994 ; 94EP- 0115 751 . 


XX 




PA 


(FARH ) HOECHST AG. 


XX 




PI 


Bartnik E, Margerie D; 


XX 




DR 


WPI; 1996-181045/19. 


XX 




PT 


Diagnosis and treatment of IL-1 mediated connective tissue diseases 


PT 


- using osteopontin, calnexin, TSG-6 gene prod. , genes encoding them 


PT 


or antibodies to them 


XX 




PS 


Claim 10; Page 25; 31pp; English. 


XX 




CC 


The present sequence is 1 of 52 cDNA mols. derived from human 


CC 


chondrocyte RNA, which were searched against DNA databases for 


CC 


homology to known human genes; in this case no homology was found. 


CC 


The DNA mols. can be used for the prodn. of gene specific primers 


CC 


and probes to isolate genes induced by treating (esp. human) 


CC 


chondrocytes with interleukin- lbeta (IL-lbeta) , and for the 


CC 


diagnosis of IL-lbeta related connective tissue diseases, in 


CC 


partic. oseteoarthritis or rheumatoid arthritis. 


XX 




SQ 


Sequence 181 BP; 63 A; 2 8 C; 35 G; 55 T; 0 other; 



Query Match 5.1%; Score 158; DB 17; Length 181; 

Best Local Similarity 98.9%; Pred. No. 4.4e-25; 

Matches 180; Conservative 0; Mismatches 0; Indels 2; Gaps 2 
Qy 2 669 GGAGATGACATTTGCTTTGGGCAGAGGCAGCTAGCCAGGACACATTTCCACTATAATTTT 2 72 8 

'IMM Mill : I [ : 1 1 1 1 I ' 1 1 1 ! II 

Db 1 GGAGATGACATTTGCTTTGGGCAGAGGCAGCTAGCCAGGACACATTTCCACTATAATTTT 60 



Qy 



2 72 9 ACAAAGTTAAATTTATAAGCTAGCATTAAGTAAAGTGAAG - TCCAGCTCCCTTGCTAAAA 2 787 



IIMMIIMIMIIIIIIMIIIIIMIIMIIIIIIII IIMIIIIIIIIIMIIII 

Db 61 ACAAAGTTAAATTTATAAGCTAGCATTAAGTAAAGTGAAGTTCCAGCTCCCTTGCTAAAA 12 0 

Qy 2 7 8 8 ATAACTAGAGGTAATAATTGGTATTCAGGTAACTCATTTACAGTCATAATGTGTTGTGAA 2 84 7 

IIMIIIIIMIIIIIIIMIIIIIIIIMIIIMIIIIIII IMIIIIIMIIMIII 

Db 121 ATAACTAGAGGTAATAATTGGTATTCAGGTAACTCATTTACA - TCATAATGTGTTGTGAA 17 9 

Qy 2848 AA 2849 



Db 180 AA 181 



Query Match 5.4%; Score 158/ DB 17; Length 181; 

Best Local Similarity 98.9%; Pred . No. 3.4e-31; 

Matches 180; Conservative 0; Mismatches 0; Indels 2; Gaps 
rAGATGACATTTGCTTTGGGCAGAGGCAGCTAGCCAGGACACATTTCCACTATAATTTT 2 5^ 

IIMIMIIIIMMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

rAGATGACATTTGCTTTGGGCAGAGGCAGCTAGCCAGGACACATTTCCACTATAATTTT 6 0 
lAAAGTTAAATTTATAAGCTAGCATTAAGTAAAGTGAAG - TCCAGCTCCCTTGCTAAAA 2 5 c 

IIIIIIIIIIIIIIIMIIIIIIIIIIMIMIIMII MIIIIMIIMIIIIIII 

'AAAGTTAAATTTATAAGCTAGCATTAAGTAAAGTGAAGTTCCAGCTCCCTTGCTAAAA 12 C 
'AACTAGAGGTAATAATTGGTATTCAGGTAACTCATTTACAGTCATAATGTGTTGTGAA 2 6 c 

1 1 1 1 M 1 1 1 i 1 1 1 1 j 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 [ IIMIIIIIMIIIMI 





OAP1 
*S *± O _L 


Db 


1 


Qy 


2541 


Db 


61 


Qy 


2600 


Db 


121 


Qy 


2660 


Db 


180 



Query Match 14.9%; Score 13 6.6; DB 17; Length 181; 

Best Local Similarity 96.4%; Pred. No. 1.8e-22; 

Matches 161; Conservative 0; Mismatches 4; Indels 2; Gap 

OV 4 66 TTGACGAAGAGGCAGCTAGCCAGGACACATTTCCACTATAATTTTACAAAGTTAAATTTA 5 

|| | | | | | | | | I I I I I I I I I I I I I I I 1 1 1 1 M 1 1 1 1 1 M I I I I I I I I I I I i I i i 1 1 I 
Db 16 TTTGGGCAGAGGCAGCTAGCCAGGACACATTTCCACTATAATTTTACAAAGTTAAATTTA 7 

OV 526 TAAGCTAGCATTAAGTAAAGTGAAG - TCCAGCTCCCTTGCTAAAAATAACTAGAGGTAAT 5 

Ml MIIIIIIIIIIIIIM IIIIIIIMIIIMMMMMIIIIIIIIIIII 

Db 7 6 TAAGCTAGCATTAAGTAAAGTGAAGTTCCAGCTCCCTTGCTAAAAATAACTAGAGGTAAT 1 

OV 585 AATTGGTATTCAGGTAACTCATTTACAGTCATAATGTGTTGTGAAAA 631 

IIIIMIIIIIMIIIIIIIIMIIM 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 L 1 1 1 1 

Db 13 6 AATTGGTATTCAGGTAACTCATTTACA-TCATAATGTGTTGTGAAAA 181 



IDd- 



RESULT 1 
AF084535 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REMARK 
COMMENT 
FEATURES 

sourC' 




gene 
CDS 



AF08 4535 3128 bp mRNA linear PRI 01-OCT-1999 

Homo sapiens laforin (EPM2A) mRNA, complete cds . 
AF084535 

AF084535.2 GI:6005985 
Homo sapiens . 

Homo sapiens _ , ^ • . 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostonu, 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 3128) 

«xiias: S iaii,B.A., Lee,J.R., Herbrick, J. A. , Huizenga, J. , Soder,S., 
MungallTA.J., Dunham,!. , Gardner, R., Fong, C . Y ., Carpenter , S . , 
Jardim,L., Satishchandra, P . , Andermann, E . , Snead,O.C. Ill , 
Lopes-Cendes,!., Tsui,L.C, Delgado-Escueta, A. V. , Rouleau, G. A. and 
Scherer S W . 

Mutations in a gene encoding a novel protein tyrosine phosphatase 

cause progressive myoclonus epilepsy 

Nat. Genet. 20 (2), 171-174 (1998) 

98442653 

9771710 

2 (bases 1 to 3128) 
Lee,J.R. and Scherer, S.W. 
Direct Submission 

Submitted (17-AUG-1998) Department of Genetics, The Hospital for 
Sick Children, 555 University Avenue, Toronto, ON MSG 1X8, Canada 

3 (bases 1 to 3128) 
Minassian,B. 

Direct Submission . ^_ n ^ 

Submitted (Ol-OCT-1999) Department of Genetics , The Hospit *\ ^ 
Sick Children, 555 University Avenue, Toronto, ON MSG 1X8, Canada 
Sequence update by submitter t 
On Oct 1, 1999 this sequence version replaced gi:3 97 B4bi. ; . 

Location/Qualifiers 
s 1. .3128 

/organism="Homo sapiens" 

/db_xref = M taxon: 9606" 

1. .3128 

/gene="EPM2A" 

1. .996 

/gene="EPM2A" 

/codon_start=l 

/product^" laforin" 

/protein_id="AAC83347 .2" 

/db xref="GI:6005986" 

/ 1 rans 1 at ion= " MRFRFGWVPPAVAGARPELLWGSRPELGRWEPRGAVRLRPAG 
TAAGDGALALQEPGLWLGEVELAAEEAAQDGAEPGRVDTFWYKFLKREPGGELSWEGN 
GPHHDRCCTYNENNLVDGVYCLPIGHWIEATGHTNEMKHTTDFYFNIAGHQAMHYSRI 
LPNIWLGSCPRQVEHVTIKLKHELGITAVMNFQTEWDIVQNSSGCNRYPEPMTPDTMI 
KLYREEGLAYIWMPTPDMSTEGRVQMLPQAVCLLHALLEKGHIVYVHCNAGVGRSTAA 
VCGWLQYVMGWNLRKVQYFLMAKRPAVYIDEEALARAQEDFFQKFGKVRSSVCSL" 



3 'UTR 

BASE COUNT 
ORIGIN 



997. .3128 
/gene="EPM2A" 
877 a 643 c 763 g 



845 t 




Query Match 99.9%; Score 3126.4; DB 9; Length 3128; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 312 7; Conservative 0; Mismatches 1; Indels 0; Gaps 



Qy 


l 


ATGCGCTTCCGCTTTGGGGTGGTGGTGCCACCCGCCGTGGCCGGCGCCCGGCCGGAGCTG 

iii i i i i i i i i i i r i i i i i i i l l l 1 1 1 1 1 1 1 1 t 1 1 t 1 1 1 1 1 


60 


Db 


l 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i i 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 i 1 1 1 1 M 1 1 

ATGCGCTTCCGCTTTGGGGTGGTGGTGCCACCCGCCGTGGCCGGCGCCCGGCCGGAGCTG 


60 


Qy 


61 


CTGGTGGTGGGGTCGCGGCCCGAGCTGGGGCGTTGGGAGCCGCGCGGTGCCGTCCGCCTG 

i i i i i i i i t 1 [ I I 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


120 


Db 


61 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 M II 1 1 1 1 1 1 M 1 M 1 

CTGGTGGTGGGGTCGCGGCCCGAGCTGGGGCGTTGGGAGCCGCGCGGTGCCGTCCGCCTG 


120 


Qy 


121 


AGGCCGGCCGGCACCGCGGCGGGCGACGGGGCCCTGGCGCTGCAGGAGCCGGGCCTGTGG 
. . , . , , , , , , , i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 


180 


Db 


121 


1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 

AGGCCGGCCGGCACCGCGGCGGGCGACGGGGCCCTGGCGCTGCAGGAGCCGGGCCTGTGG 


180 


Qy 


181 


CTCGGGGAGGTGGAGCTGGCGGCCGAGGAGGCGGCGCAGGACGGGGCGGAGCCGGGCCGC 

, , , i i i i i i t t i < i i i i i i i i i i i i i i l l l l l 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 


240 


Db 


181 


1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 M 1 1 1 M 1 M II 1 1 1 1 1 1 1 1 1 1 

CTCGGGGAGGTGGAGCTGGCGGCCGAGGAGGCGGCGCAGGACGGGGCGGAGCCGGGCCGC 


240 


Qy 


241 


GTGGACACGTTCTGGTACAAGTTCCTGAAGCGGGAGCCGGGAGGAGAGCTCTCCTGGGAA 
, , i , ■ i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i 

1 1 1 1 1 1 1 II 1 1 1 II 1 II 1 1 II 1 II 1 1 1 1 1 1 M II 1 1 1 II II 1 II 1 1 II 1 1 1 II 1 1 II 1 1 1 

GTGGACACGTTCTGGTACAAGTTCCTGAAGCGGGAGCCGGGAGGAGAGCTCTCCTGGGAA 


300 


Db 


241 


300 


Qy 


301 


GGCAATGGACCTCATCATGACCGTTGCTGTACTTACAATGAAAACAACTTGGTGGATGGT 

i i t i i t i i i i i i i i i i i i i iii i i I I 1 I t 1 1 1 1 l 1 1 1 I 1 1 1 ! 


360 


Db 


301 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 

GGCAATGGACCTCATCATGACCGTTGCTGTACTTACAATGAAAACAACTTGGTGGATGGT 


360 


Qy 


361 


GTGTATTGTCTCCCAATAGGACACTGGATTGAGGCCACTGGGCACACCAATGAAATGAAG 
, , . . , , i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i 

II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 1 1 1 M II 

GTGTATTGTCTCCCAATAGGACACTGGATTGAGGCCACTGGGCACACCAATGAAATGAAG 


420 


Db 


361 


420 


Qy 


421 


CACACAACAGACTTCTATTTTAATATTGCAGGCCACCAAGCCATGCATTATTCAAGAATT 

,! I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l 

1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CACACAACAGACTTCTATTTTAATATTGCAGGCCACCAAGCCATGCATTATTCAAGAATT 


480 


Db 


421 


480 


Qy 


481 


CTACCAAATATCTGGCTGGGTAGCTGCCCTCGTCAGGTGGAACATGTTACCATCAAACTG 
,,, . i i i i > i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i 


540 


Db 


481 


| | || | | | | | || | II 1 1 1 1 II 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 
CTACCAAATATCTGGCTGGGTAGCTGCCCTCGTCAGGTGGAACATGTAACCATCAAACTG 


540 


Qy 


541 


AAGCATGAATTGGGGATTACAGCTGTAATGAATTTCCAGACTGAATGGGATATTGTACAG 
, , , , , , , i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 

1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AAGCATGAATTGGGGATTACAGCTGTAATGAATTTCCAGACTGAATGGGATATTGTACAG 


600 


Db 


541 


600 


Qy 


601 


AATTCCTCAGGCTGTAACCGCTACCCAGAGCCCATGACTCCAGACACTATGATTAAACTA 

MIIMIIIIIIIIMMIIIIIIIIMIMIIIIMIIIIIIIIIIIIIMIIIIIIII 

AATTCCTCAGGCTGTAACCGCTACCCAGAGCCCATGACTCCAGACACTATGATTAAACTA 


660 


Db 


601 


660 


Qy 


661 


TATAGGGAAGAAGGCTTGGCCTACATCTGGATGCCAACACCAGATATGAGCACCGAAGGC 


720 


Db 


661 


IIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMIIIMIIIM 

TATAGGGAAGAAGGCTTGGCCTACATCTGGATGCCAACACCAGATATGAGCACCGAAGGC 


720 


Qy 


721 


CGAGTACAGATGCTGCCCCAGGCGGTGTGCCTGCTGCATGCGCTGCTGGAGAAGGGACAC 


780 


Db 


721 


IIMIIIIIIIIMMIMMIIIIIIIIMMIIIMIIIMIIIIIIIIIMIMIII 

CGAGTACAGATGCTGCCCCAGGCGGTGTGCCTGCTGCATGCGCTGCTGGAGAAGGGACAC 


780 


Qy 


781 


ATCGTGTACGTGCACTGCAACGCTGGGGTGGGCCGCTCCACCGCGGCTGTCTGCGGCTGG 


840 




Db 


781 


I 1 1 1 1 1 I 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 II M 1 M 

ATCGTGTACGTGCACTGCAACGCTGGGGTGGGCCGCTCCACCGCGGCTGTCTGCGGCTGG 


840 


Qy 


841 


CTCCAGTATGTGATGGGCTGGAATCTGAGGAAGGTGCAGTATTTCCTCATGGCCAAGAGG 

, . . , i i i i i i i i t i i i i i i i i i i ) i i t i i i i 


900 


Db 


841 


| | | | | | | | | | | | 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 II 
CTCCAGTATGTGATGGGCTGGAATCTGAGGAAGGTGCAGTATTTCCTCATGGCCAAGAGG 


900 


Qy 


901 


CCGGCTGTCTACATTGACGAAGAGGCCTTGGCCCGGGCACAAGAAGATTTTTTCCAGAAA 
i i i i i < i i i i i i i i i i i i i i i i i i i i i i i i i t i i i 


960 


Db 


901 


| | | | | | | | | | | | 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 
CCGGCTGTCTACATTGACGAAGAGGCCTTGGCCCGGGCACAAGAAGATTTTTTCCAGAAA 


960 


Qy 


961 


TTTGGGAAGGTTCGTTCTTCTGTGTGTAGCCTGTAGCTGGTCAGCCTGCTTCTGCCCCCT 

IMIMMIIIIIIIIIIIIIIIIIIIMMIIIIMIIIIIIIIMIIMMIIMIII 

TTTGGGAAGGTTCGTTCTTCTGTGTGTAGCCTGTAGCTGGTCAGCCTGCTTCTGCCCCCT 


1020 


Db 


961 


1020 


Qy 


1021 


CCTGATTTCCCTAAGGAGCCTGGGATGATGTTGGTCAAATGACCTAGAAACAAGGATTCT 

1 ! 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1! ! 1 1 i 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 ! 1 ! 1 1 1 1 1 1 1 1 

CCTGATTTCCCTAAGGAGCCTGGGATGATGTTGGTCAAATGACCTAGAAACAAGGATTCT 


1080 


Db 


1021 


1080 


Qy 


1081 


ACCTGAACTGAAAGGACTGTGTGACCTCCCCAAGCCAACCACTTTCACCTGGGATGACTT 

* ■ ■ t i i i i i i i i i i i i i i i i i i 1 1 1 1 1 1 1 \ 1 1 1 

IIMIIIIIIIIIIMIIMIMIIIIMMIIIIIMMIIIMIIIIIIIIMIIMI 

ACCTGAACTGAAAGGACTGTGTGACCTCCCCAAGCCAACCACTTTCACCTGGGATGACTT 


1140 


Db 


1081 


1140 


Qy 


1141 


TCGATTATGCTTTGGTTTGGGGCTGTATTTTTGAAATACTCTACAAGAAAGCTGTGGCTC 

| | | I I I I I I I 1 I 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
TCGATTATGCTTTGGTTTGGGGCTGTATTTTTGAAATACTCTACAAGAAAGCTGTGGCTC 


1200 


Db 


1141 


1200 


Qy 


1201 


AACACATGAGAAGAAGCACGAAGCAGTTAGGCTGTACATCAGACAGAAGGGTAATGCGTG 

t i t t , i i * ■ i i r i i i i i 1 1 1 1 1 1 t 1 1 1 1 1 I L 1 

IIMIIIIIIIIIIIIIIIIIMMIIIMIIIIIMIMIIIIIIIIIIIMIMIIII 

AACACATGAGAAGAAGCACGAAGCAGTTAGGCTGTACATCAGACAGAAGGGTAATGCGTG 


1260 


Db 


1201 


1260 


Qy 


1261 


CAGTTCCTGCTGCCTGCAGGCAGACGAGGCCTTTGCTTTACAGCACTGTATGTGTTGCAC 

* i i i i i i i i * i i i i i i i i l t i r i 1 1 1 1 1 1 1 1 1 1 1 

IIIIIIIMIIIIIMIIIIIIMIIIIIIIIIIIIMIIIIMIIIIIIIMIIIIIII 

CAGTTCCTGCTGCCTGCAGGCAGACGAGGCCTTTGCTTTACAGCACTGTATGTGTTGCAC 


1320 


Db 


1261 


1320 


Qy 


1321 


GATGGATCCGTGACAGCACTTTCCTGTTGCACTGAAACTCTTGGCCATGTAGAGGAAAAG 

t i i i r 1 i 1 | 1 [ 1 1 1 1 1 | J 1 1 1 I I 1 1 t 1 | 

| || | | | | | || | | | | || 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GATGGATCCGTGACAGCACTTTCCTGTTGCACTGAAACTCTTGGCCATGTAGAGGAAAAG 


1380 


Db 


1321 


1380 


Qy 


1381 


ATATGGAGTTATGTGGATTTCATCACTAGTATGTGTGCCGTGAGCTGGTCAGTTGCCAAA 


1440 


Db 


1381 


IMIIMIIIMMIIIIMIIIIMIIIIIIIIIIIIIMIIIIMMIIMMIIIM 

ATATGGAGTTATGTGGATTTCATCACTAGTATGTGTGCCGTGAGCTGGTCAGTTGCCAAA 


1440 


Qy 


1441 


GGAGGAAATAAGGTTAGAAGCCTGAACCGTTACAAAAGAAGAGCTCACTATGGTCAAAAA 

MMIIIMMIIIIMIIIIIIMIMIIIIIIIIIIMIIIIIIIIIIIIIIMMII 

GGAGGAAATAAGGTTAGAAGCCTGAACCGTTACAAAAGAAGAGCTGACTATGGTCAAAAA 


1500 


Db 


1441 


1500 


Qy 


1501 


GTGATGGCTTTCAGGACTTGTTTTTTATCCTGCCTCACAGTTGTTAAAGTCTGTTCCAAG 

MIIIIIIIIIIMIIMMIIIIIIIIIIIIIIIMIIIIIMIIIMIIIIMIIMI 

GTGATGGCTTTCAGGACTTGTTTTTTATCCTGCCTCACAGTTGTTAAAGTCTGTTCCAAG 


1560 


Db 


1501 


1560 


Qy 


1561 


GCATCACCTTCCTTCTCTACCCAACAACCCTGTGTAACAACTAAAGTAGAATTATCTCTC 


1620 


Db 


1561 


MMIIIIIIIIMIIIIMIIIIMIIIMIIMIMMIIIIIMIIMIIIIIIMI 

GCATCACCTTCCTTCTCTACCCAACAACCCTGTGTAACAACTAAAGTAGAATTATCTCTC 


1620 


Qy 


1621 


ATTTGTTGGTGGTTTTTCCTCAAAATTACCAAACAAAGCAAAAAATACCCTTGTTTTTTA 


1680 



: 1 1 1 ; 1 1 1 M 1 1 , 1 1 1 1 1 ! M i ' : 1 1 1 : i 1 1 M ! 1 1 1 m 1 1 M M 1 1 , 1 1 1 . : 1 1 M : : : 



1621 ATTTGTTGGTGGTTTTTCCTCAAAATTACCAAACAAAGCAAAAAATACCCTTGTTTTTTA 1680 

1681 TAGTTGAGATGTCAAGGAAGTTAAATTGAGGCTTAATGAGCATAGGTAGCTTGTCCAAGG 1740 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I M I I I I I I I I I I I I I M I I I I I I I I I 
1681 TAGTTGAGATGTCAAGGAAGTTAAATTGAGGCTTAATGAGCATAGGTAGCTTGTCCAAGG 174 0 

1741 TCTCATGACCAGTCAAGGGCAAGCTGGAGTTAATAATCTATATTTATTTGACTCAGCACT 1800 

I I I I I I I I I I I | | | | I II I II I I I II II II I I I I I M M I I M II M I M 1 1 1 1 1 1 1 1 1 1 
1741 TCTCATGACCAGTGAAGGGCAAGCTGGAGTTAATAATCTATATTTATTTGACTCAGCACT 180 0 

1801 GTTTTCATCACAACTTGTTTTCCCAGCATCATGTAGTGCATTTAGTTTTGTCTTTCTCAG 1860 

I I I I I I I | | I I I M I I I I I I I I I I I I II I I I I I I I M I I I I M I I M I II I I I M I I I II 
1801 GTTTTCATCACAACTTGTTTTCCCAGCATCATGTAGTGCATTTAGTTTTGTCTTTCTCAG 186 0 

1861 GGTATAGTCAATATGCCTGCAGGAGTTTCTATAGCGAGACATAGAATAGTATTCTGATCA 192 0 

M II I M II II M 1 1 1 1 II 1 1 1 1 1 I M II I II 1 1 1 M 1 1 1 1 I 1 1 1 M I 1 1 II I I 1 1 1 1 1 1 

1861 GGTATAGTCAATATGCCTGCAGGAGTTTCTATAGCGAGACATAGAATAGTATTCTGATCA 192 0 

1921 GTTGCCAAAGAATCTAGGAAATTAGTTGTATTTTGTGCAAGCTAATTTAAAAACATGATG 1980 
i , I I I I I I I II I II I M I I I I I II I I I II I I I I I 1 I II 1 I 



I I I | | M M 1 I | I | M | M | | | M | I || 1 I I I I I II I II I I I M II I I I I I i i i 

1921 GTTGCCAAAGAATCTAGGAAATTAGTTGTATTTTGTGCAAGCTAATTTAAAAACATGATG 198 0 

1981 GGCTGTTTTAAGACCAGAGTGGAAATTCATGAGAGGAACTATACTACCAAAAGAGCCCAA 2 04 0 

I I I I I I I | | | || || I I I II I I I I I II I I I I I I I M I I I I I I M I I I I I M I M I I I I I I I 
1981 GGCTGTTTTAAGACCAGAGTGGAAATTCATGAGAGGAACTATACTACCAAAAGAGCCCAA 2 04 0 

2 041 ATGACCAAATCCATGGATAATTGCTTCAC AGCCTTGGCCATCCTGGCTCAGCTCTCAATT 210 0 

I I I I I | I I I I I I I | I | | | | | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 
2041 ATGACCAAATCCATGGATAATTGCTTCACAGCCTTGGCCATCCTGGCTCAGCTCTCAATT 210 0 

2101 TAGTATAATATGCAGTTCCTGTGCCTCCAGACTATGCAGCTCATCACCCTAGGTTCTACA 2160 

I Mill I INI I Mill I MINIMUM Mill I INN I INN MIIIMIIII II 

2101 TAGTATAATATGCAGTTCCTGTGCCTCCAGACTATGCAGCTCATCACCCTAGGTTCTACA 2160 

2161 GGAAATACAGAGATGAACAACTTTGCCTTCAAAAAATGTGCTGCCTAGAAAACAGACCTG 2220 

| | I I II I I II I II II I I II I I I II II II I I I I II II II I I I I M I II II I II I II II I II 
2161 GGAAATACAGAGATGAACAACTTTGCCTTCAAAAAATGTGCTGCCTAGAAAACAGACCTG 2220 

2221 CATTTCAACCCAACTGTAATGCAGGATTTGGACCATGAATGATATGCTAGAATAGAAGAA 22 80 

I I I I I | I I I I | | | | | | M | | | | | | M I II M M I II M II M I I I II II M II M II II I 
2221 CATTTCAACCCAACTGTAATGCAGGATTTGGACCATGAATGATATGCTAGAATAGAAGAA 22 8 0 

22 81 AGAGAAGTGTTTTTTTAATTGAGAGCCTCTATGTGCAAGGTGATATATAATCATATCCAG 234 0 

I I | | II || II I II II II I II I II II I M I I M M M II M M I I M I M I II II I I M M 
2281 AGAGAAGTGTTTTTTTAATTGAGAGCCTCTATGTGCAAGGTGATATATAATCATATCCAG 234 0 

7 341 TTTAATCTTCACAATATCCAATGAAGAAGGTCTCATTATCTCCATGATAAAGATGGGGAA 24 0 0 

| | | | | | | | | | | | | | | | | M II I I M M I M I II II II I I M M M I I M I II II I M I II _ 
2341 TTTAATCTTCACAATATCCAATGAAGAAGGTCTCATTATCTCCATGATAAAGATGGGGAA 24 00 

2401 ACTAAGGTCAGAAGGGTTAACTCAACTGTCTATTGTCACATGATGAATAAATAGATGAAG 24 60 

I | | | || | || | | || | II II II I I M I M M II I II I I M I II II II II II II II M I II II 
24 01 ACTAAGGTCAGAAGGGTTAACTCAACTGTCTATTGTCACATGATGAATAAATAGATGAAG 2460 

2461 TGAGATACAAAGCTGGGTTTGATTCAAAGCCCTTACTTTCCTAATTAAACTATGATGCGT 252 0 

| | || | M || II II II II II M I M II I M II II II M M II II I I II II I M II II II II 
2461 TGAGATACAAAGCTGGGTTTGATTCAAAGCCCTTACTTTCCTAATTAAACTATGATGCGT 2520 



Qy 


2521 


Db 


2521 


Qy 


2581 


Db 


2581 


Qy 


2641 


Db 


2641 


Qy 


2701 


Db 


2701 


Qy 


2761 


Db 


2761 


Qy 


2821 


Db 


2821 


Qy 


2881 


Db 


2881 


Qy 


2941 


Db 


2941 


Qy 


3001 


Db 


3001 


Qy 


3061 


Db 


3061 


Qy 


3121 


Db 


3121 



IIIIIIIIIIIIIIIIIIIMIIIIIIIII1IIIIIIIIIIIIIMIIIMIIIMIIII 

ATTTATTTTTCTGCACCTTCCTTTCTTCCACAAACACATATTGATAGATGCAAGAGACTC 
TTATTTATAAGGCGTGGGGGACAAGAAGGATACAAGGTAAGTTTCAGTGGAGCTCAGAGG 

IIIMIIIIIIIIIIIIIIIMIIIMIMIIIIIIIMIMIIIIIIIIMMIIIIII 

TTATTTATAAGGCGTGGGGGACAAGAAGGATACAAGGTAAGTTTCAGTGGAGCTCAGAGG 
ACGGGGAGATAGAACTGTGGCACTTAGGGGAGATGACATTTGCTTTGGGCAGAGGCAGCT 

MIIIIIIIIMIIMIIIIIIIIIIIIIIIIIIIIMIIIIMMIIMIMMMIII 

ACGGGGAGATAGAACTGTGGCACTTAGGGGAGATGACATTTGCTTTGGGCAGAGGCAGCT 
AGCCAGGACACATTTCCACTATAATTTTACAAAGTTAAATTTATAAGCTAGCATTAAGTA 

MIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIMMIIMMIIIMIIIIMIMI 

AGCCAGGACACATTTCCACTATAATTTTACAAAGTTAAATTTATAAGCTAGCATTAAGTA 
AAGTGAAGTCCAGCTCCCTTGCTAAAAATAACTAGAGGTAATAATTGGTATTCAGGTAAC 

IIIIIIIIIIIIIIIIIMMIMIIIIIIIIIIIIIIMIIIIIIMIMIIMMIII 

AAGTGAAGTCCAGCTCCCTTGCTAAAAATAACTAGAGGTAATAATTGGTATTCAGGTAAC 
TCATTTACAGTCATAATGTGTTGTGAAAATTTAATCTTAAAAATTAAATTTTTAAACTAT 

IIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIMMI 

TCATTTACAGTCATAATGTGTTGTGAAAATTTAATCTTAAAAATTAAATTTTTAAACTAT 
GTGGGTCTGTGAATTTCTTTAATGTCTAAGAAATCCAGCTTCATAATTTCCATGATACAA 

i 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 i I ! 1 1 ! 1 1 1 1 ! i 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 

GTGGGTCTGTGAATTTCTTTAATGTCTAAGAAATCCAGCTTCATAATTTCCATGATACAA 
AGATCTTTTTTCAGGTGGATTTTTACCTTTGTTCCTTTTGCTCTGATAGACAAAATCAGT 

IIIIIIIIIIIIIIIIIIMIMIIIMMIIIIMIIIIIIIIIIIIIIIMIIIIIII 

AGATCTTTTTTCAGGTGGATTTTTACCTTTGTTCCTTTTGCTCTGATAGACAAAATCAGT 
TTAGGACTATTAAAGAATGTTTTGGAATAAACTGTCTTTTTCCTCAATGAATGGGATGTC 

1 1 1 1 1 1 1 ! 1 1 1 i 1 1 1 1 1 i E 1 1 1 1 1 1 1 1 1 i I ! 1 1 ! 1 1 1 ! 1 1 ! 1 1 1 E 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 

TTAGGACTATTAAAGAATGTTTTGGAATAAACTGTCTTTTTCCTCAATGAATGGGATGTC 
TAATGTATTTCAAAATCACCCAAAACTTTTGGCAAATAAAAGCATTTAAAAAGAAAAAAA 

IIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIM 

TAATGTATTTCAAAATCACCCAAAACTTTTGGCAAATAAAAGCATTTAAAAAGAAAAAAA 

AAAAAAAA 3128 
Illlllll 




1M- 



S<P< 



Result 10 

H^A130764 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
FEATURES 

source 



gene 
CDS 



BASE COUNT 
ORIGIN 



HSA130764 1037 bp mRNA linear PRI 16-DEC-1998 

Homo sapiens mRNA for LAFPTPase , isoform 2, partial. 

AJ130764 

AJ13 0764 .1 GI:3980310 
EPM2 gene; LAFPTPase. 
human . 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 1037) 
Serratosa, J.M. 
Direct Submission 

Submitted (2 8-SEP-1998) Serratosa J.M., Servicio de neurologia, 
Furidacion Jimenez Diaz, Avda . Reyes Catolicos 2, 28040-MADRID, 
SPAIN 

2 (bases 1 to 1037) 

Serratosa, J.M. , Gomez -Garre , P . , Gallardo,E. , Anta,B. , 
Beltran-Valero de Bernabe,D., Lindhout,D., Augustin, P . B . , 
Tassinari,C. , Michelucci , R . , Malafosse, A. , Topcu,M., Grid,D., 
Dravet,C. , Berkovic,S. and Rodriguez de Cordoba, S. 

A novel protein tyrosine phosphatase gene is mutated in progressive 
myoclonus epilepsy of the Lafora type (EPM2) 
Hum. Mol . Genet. 8 (1999) In press 

Location/Qualifiers 

1. .1037 

/organism= "Homo sapiens" 
/db_xref ="taxon: 9606" 
/chromosome= " 6 " 
/map="6q23-q25" 
3. .713 

/gene= "EPM2 » ^ ^mj^J^^- 

<3. .713 
/gene="EPM2 " 
/note="isof orm 2" 
/ codon_start=l 
/product= "LAFPTPase " 
/protein_id="CAA10200 .1" 
/db_xref="GI : 3980311" 

/translations "DTFWYKFLKREPGGELSWEGNGPHHDRCCTYNENNLVDGVYCLP 
IGHWIEATGHTNEMKHTTDFYFNIAGHQAMHYSRILPNIWLGSCPRQVEHVTIKLKHE 
LGITAVMNFQTEWDIVQNSSGCNRYPEPMTPDTMIKLYREEGLAYIWMPTPDMSTEGR 
VQMLPQAVCLLHALLEKGHIVYVHCNAGVGRSTAAVCGWLQYVMGWNLRKVQYFLMAK 
RPAVYIDEEAASQDTFPL " 
296 a 215 c 241 g 285 t 



SiD3 - 99,0 % 

64 -73* -spy 

*7 vwL&A^ & d^' J 



Query Match 21.9%; Score 685; DB 9; Length 1037; 

Best Local Similarity 100.0%; Pred. No. le-111; 

Matches 685; Conservative 0; Mismatches 0; Indels 0; 



Gaps 



0; 



Qy 2 42 TGGACACGTTCTGGTACAAGTTCCTGAAGCGGGAGCCGGGAGGAGAGCTCTCCTGGGAAG 3 01 

IIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIII 

Db 1 TGGACACGTTCTGGTACAAGTTCCTGAAGCGGGAGCCGGGAGGAGAGCTCTCCTGGGAAG 6 0 



Qy 



3 02 GCAATGGACCTCATCATGACCGTTGCTGTACTTACAATGAAAACAACTTGGTGGATGGTG 3 61 



Db 


61 


Qy 


362 


Db 


121 


Qy 


422 


Db 


181 


Qy 


482 


Db 


241 


Qy 


542 


Db 


301 


Qy 


602 


Db 


361 


Qy 


662 


Db 


421 


Qy 


722 


Db 


481 


Qy 


782 


Db 


541 


Qy 


842 


Db 


601 


Qy 


902 


Db 


661 



120 
421 
180 
481 
240 



I I I I I I I | | | | | | | | | | | | I I I I I I I M II II I II M M I I I M M I M M M M I II I I 

GCAATGGACCTCATCATGACCGTTGCTGTACTTACAATGAAAACAACTTGGTGGATGGTG 
TGTATTGTCTCCCAATAGGACACTGGATTGAGGCCACTGGGCACACCAATGAAATGAAGC 

IIIMIIIIMIIIMMIIIIIIIIIMIIIIIIMIIMIIMIIIMIIIIMim 

TCTATTGTCTCCCAATAGGACACTGGATTGAGGCCACTGGGCACACCAATGAAATGAAGC 
ACACAACAGACTTCTATTTTAATATTGCAGGCCACCAAGCCATGCATTATTCAAGAATTC 

1 1 1 1 1 1 1 1 1 : . . i ! , i i 1 1 ! . 1 : 1 1 1 1 1 1 1 1 : i 1 1 1 : 1 : 1 1 1 ' ' 1 1 1 1 ! 1 1 1 : 1 1 1 1 , 1 1 1 1 

A.CA.CAACAGACTTCTATTTTAATATTGCAGGCCACCAAGCCATGCATTATTCAAGAATTC 
TACCAAATATCTGGCTGGGTAGCTGCCCTCGTCAGGTGGAACATGTTACCATCAAACTGA 541 

i 1 1 , i , 1 1 1 ' 1 1 I : ; 1 1 1 1 ! 1 M 1 ! . 1 1 1 1 ! M M . 1 ' I 1 1 ! 1 1 1 , 1 1 1 II ' i 1 1 1 i ' ! 1 1 

TACCAAATATCTGGCTGGGTAGCTGCCCTCGTCAGGTGGAACATGTTACCATCAAACTGA 3 00 
AGCATGAATTGGGGATTACAGCTGTAATGAATTTCCAGACTGAATGGGATATTGTACAGA 6 0 1 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIII 

AGCATGAATTGGGGATTACAGCTGTAATGAATTTCCAGACTGAATGGGATATTGTACAGA 3 6 0 

ATTCCTCAGGCTGTAACCGCTACCCAGAGCCCATGACTCCAGACACTATGATTAAACTAT 661 

| | M I I I I I II I I I I M I I I I I I I I I I I I I I I I M M I I I I I I I II I I I I I I I I I M I I I 
ATTCCTCAGGCTGTAACCGCTACCCAGAGCCCATGACTCCAGACACTATGATTAAACTAT 420 



I M | | | I II I I I I I I M I I I I I I I I I M II I I I I M I I I I I I I M 1 M I M I I I I M I I I 
ATAGGGAAGAAGGCTTGGCCTACATCTGGATGCCAACACCAGATATGAGCACCGAAGGCC 

GAGTACAGATGCTGCCCCAGGCGGTGTGCCTGCTGCATGCGCTGCTGGAGAAGGGACACA 

1 1 1 1 ; M M 1 1 1 ' 1 1 II 1 1 1 1 11 1 1 ! 1 1 M 1 M ! ,: M 1 1 1 1 II l M 1 1 1 M I II 1 1 1 M 

GAGTACAGATGCTGCCCCAGGCGGTGTGCCTGCTGCATGCGCTGCTGGAGAAGGGACACA 
TCGTGTACGTGCACTGCAACGCTGGGGTGGGCCGCTCCACCGCGGCTGTCTGCGGCTGGC 

I I > ; 1 1 1 m : 1 1 i 1 1 1 1 1 1 1 1 1 1 1 m 1 1 ; ' ! 1 1 : ; : 1 1 r 1 1 1 1 1 ; 1 1 m , 1 1 1 , 1 1 1 m 

TCGTGTACGTGCACTGCAACGCTGGGGTGGGCCGCTCCACCGCGGCTGTCTGCGGCTGGC 
TCCAGTATGTGATGGGCTGGAATCTGAGGAAGGTGCAGTATTTCCTCATGGCCAAGAGGC 

MMIIIIIIIIIIMIMIIIIMIIIMMIIMIIMMIIMIIMMIIIIIIII 

TCCAGTATGTGATGGGCTGGAATCTGAGGAAGGTGCAGTATTTCCTCATGGCCAAGAGGC 
CGGCTGTCTACATTGACGAAGAGGC 926 

1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 M i 1 1 1 1 1 1 1 



480 
781 
540 
841 
600 
901 
660 



^ ^3 „...___ 



Qu¥ry<Match 22.9%; Score 673.8; DB 9; Length 1037; 

Best Local Similarity 99.0%; Pred. No. 2.4e-150; 

Matches 678; Conservative 0; Mismatches 7; Indels 0; Gaps 

Ov 54 TGGACACGTTCTGGTACAAGTTCCTGAAGCGGGAGCCGGGAGGAGAGCTCTCCTGGGAAG 113 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 1 TGGACACGTTCTGGTACAAGTTCCTGAAGCGGGAGCCGGGAGGAGAGCTCTCCTGGGAAG 60 

Ov 114 GCAATGGACCTCATCATGACCGTTGCTGTACTTACAATGAAAACAACTTGGTGGATGGTG 173 
I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I M II I I I I I I I M I I I I I I I I I M I I II 

Db 6i gcaatggIcctcatcatgaccgttgctgtacttacaatgaaaacaacttggtggatggtg 120 

OV 174 TGTATTGTCTCCCAATAGGACACTGGATTGAGGCCACTGGACACACCAATGAAATGAAGC 233 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II I II M M 1 1 1 1 1 I M 1 1 1 1 M 1 1 1 1 1 II I II 

Db 121 TGTATTGTCTCCCAATAGGACACTGGATTGAGGCCACTGGGCACACCAATGAAATGAAGC 180 

Qy 234 ACACAACAGACTTCTATTTTAATATTGCAGGCCACCAAGCCATG 293 

I I 1 I I I I I I I I | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 
Db 181 ACACAACAGACTTCTATTTTAATATTGCAGGCCACCAAGCCATGCATTATTCAAGAATTC 240 

Ov 2 94 TACCAAATATCTGGCTGGGTAGCTGCCCTCGACAGGTGGAACATGTTACCATCAAACTGA 353 

I I II I I I I I II I II I II I I I I I I I 1 1 1 1 1 1 1 I I I I I I I I I I I I I I I I I I M I I I I I I I I 
Db 241 TACCAAATATCTGGCTGGOT 300 

Qy 354 AGCATGAATTGGGGATTACAGCTGTCATGAATTTCCAGACTG 413 

I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3 01 AGCATGAATTGGGGATTACAGCTGTAATGAATTTCCAGACTGAATGGGATATTGTACAGA 360 

QV 414 ATTCCTCATGCTGTAACCGCTACCCAGAGCCCATGACTCCAGACACTATGATTAAACTAT 473 

I I I I I I I I I I 1 I I I | I | | | | | || I I I I I I II I I II I I I I I M I I I I I I I I I I I M I I I I 
Db 361 ATTCCTCAGGCTGTAACCGCTACCCAGAGCCCATGACTCCAGACACTATGATTAAA 420 

OV 474 CTAGGGAAGAAGGCTTGGCCTACATCTGGATGCCAACACCAGATATGAGCACCGCAGGCC 533 

I I I I I I I | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 ATAGGGAAGAAGGCTTGGCCTACATCTGGATGCCAACACCAGATATGAGCACCGAAGGCC 48 0 

OV 534 GAGTACAGATGCTGCCCCAGGCGGTGTGCCTGCTGCATGCGCTGCTGGAGAAGGGACACA 5 93 

I I I 1 I I I I I I I I I I | II I II I I I I I I I I M II I I I I I I M I II I M I 1 1 1 1 1 1 1 1 1 1 1 1 1 
Db 481 GAGTACAGATGCTGCCCCAGGCGGTGTGCCTGCTGCATGCGCTGCTGGAGAAGGGACACA 54 0 

OV 594 TCGTGTACGTGCACTGCAACGCTGGGGTGGGCCGCTCCACCGCGGCTGTCTGCGGCTGGC 653 

I I I I I I I I I 1 I | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 TCGTGTACGTGCACTGCAACGCTGGGGTGGGCCGCTCCACCGCGGCTGTCTGCGGCTGGC 600 

ny 654 TCCAGTATGTGATGGGCTGGAATCTGAGGAAGGTGCAGTATTTCCTCATGGCCAAGAGGC 713 

I I I I I I | I I 1 I I I I | I I I I I I I I I I II I I I I I I M M 1 1 1 1 1 M I I I I I I I I I I I 1 1 M I 
Db 601 TCCAGTATGTGATGGGCTGGAATCTGAGGAAGGTGCAGTATTTCCTCATGGCCAAGAGGC 66 0 

Ov 714 CGGCTGTCTACATTGACGAAGAGGC 73 8 

IIIIIIIIIIIIIIIIIIIIMIII 
Db 661 CGGCTGTCTACATTGACGAAGAGGC 685 



It) 5 



/ 1 y / 



RESULT 1 
HSA130764 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
FEATURES 

source 



Grid,D. , 



gene 
CDS 



HSA130764 1037 bp mRNA linear PRI 16-DEC-1998 

Homo sapiens mRNA for LAFPTPase , isoform 2, partial. 
AJ130764 

AJ130764.1 GI:3980310 
EPM2 gene; LAFPTPase. 
human . 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 1037) 
Serratosa, J.M. 

Direct Submission _ _ . 

Submitted (28-SEP-1998) Serratosa J.M., Servicio de neurologia, 
Fundacion Jimenez Diaz, Avda . Reyes Catolicos 2, 28040-MADRID, 
SPAIN 

2 (bases 1 to 1037) 

Serratosa, J.M. , Gomez-Garre , P . , Gallardo,E., Anta,B., 
Beltran-Valero de Bernabe^., Lindhout,D., August in, P. B . 
Tassinari,C. , Michelucci , R . , Malafosse,A. , Topcu,M. 
Dravet,C, Berkovic,S. and Rodriguez de Cordoba, S. 

A novel protein tyrosine phosphatase gene is mutated in progressive 
myoclonus epilepsy of the Lafora type (EPM2) 
Hum. Mol. Genet. 8 (1999) In press 

Location/Qualifiers 
5 1. .1037 

/organism="Homo sapiens" 

/ db_xr e f = " t axon -.9606" 

/chromosome^ " 6 " 

/map="6q2 3-q2 5 n 

3. .713 

/gene="EPM2" 

<3. .713 

/gene="EPM2 " 

/note=" isoform 2" 

/codon_start=l 

/product =" LAFPTPase" 

/protein_id= " CAA102 0 0.1" 

/db xref="GI : 3980311" 

/ trans la t ion= " DTFWYKFLKREPGGELSWEGNGPHHDRCCTYNENNLVDGVYCLP 
IGHWIEATGHTNEMKHTTDFYFNIAGHQAMHYSRILPNIWLGSCPRQVEHVTIKLKHE 
LGITAVMNFQTEWDIVQNSSGCNRYPEPMTPDTMIKLYREEGLAYIWMPTPDMSTEGR 
VQMLPQAVCLLHALLEKGHIVYVHCNAGVGRSTAAVCGWLQYVMGWNLRKVQYFLMAK 

RPAVYIDEEAASQDTFPL " 



BASE COUNT 
ORIGIN 



296 a 



215 C 



241 g 



285 t 



Query Match 87.7%; 

Best Local Similarity 99.1%; Pred. No. 1.3e-175; 
Matches 807; Conservative 0; Mismatches 7; 



Score 802.8; DB 9; Length 1037; 

0 ; Gaps 



Indels 



0; 



QY 
Db 

QY 



76 



1 7 GGATTATTCAAGAATTCTACCAAATATCTGGCTGGGTAGCTGCCCTCGACAGGTGGAACA 

I I II I M I Mi I i II I III 1 1 Mil II I IN IN III M Ml Mill Ml III IN II 

224 GCATTATTCAAG 283 
77 TGTTACCATCAAACTGAAGCATGAATTGGGGATTACAGCTGTCATGAATTTCCAGACTGA 136 



IIIIIIIIIMIIIIIMIIIIIIIIIIIIIIIIMIIIIII IIIIIIIIIIMMIII 

4 TGTTACCATCAAACTGAAGCATGAATTGGGGATTACAGCTGTAATGAATTTCCAGACTGA 



Db 


284 


TGTTACCATCAAACTGAAGCATGAATTGGGGATTACAGCTGTAATGAATTTCCAGACTGA 


343 


Qy 


137 


ATGGGATATTGTTCAGAATTCCTCATGCTGTAACCGCTACCCAGAGCCCATGACTCCAGA 

MIIIMIIIII llllllllllll IIIIMIMIIIIIMIIIMIIIIMMIMM 

ATGGGATATTGTACAGAATTCCTCAGGCTGTAACCGCTACCCAGAGCCCATGACTCCAGA 


196 


Db 


344 


4 03 


Qy 


197 


CACTATGATTAAACTATCTAGGGAAGAAGGCTTGGCCTACATCTGGATGCCAACACCAGA 

Mlllllllllllllll 1 , 1 ' l 1 i : 1 ■ ■ 1 1 1 1 

CACTATGATTAAACTATATAGGGAAGAAGGCTTGGCCTACATCTGGATGCCAACACCAGA 


2 56 


Db 


404 


4 63 


Qy 


257 


TATGAGCACCGCAGGCCGAGTACAGATGCTGCCCCAGGCGGTGTGCCTGCTGCATGCGCT 


3 16 


Db 


464 


MM II II III 1 1 1 II 1 1 1 1 II 1 M 1 1 II 1 1 M M 1 1 M 1 1 II M 1 1 1 II 1 II 1 INN 

TATGAGCACCGAAGGCCGAGTACAGATGCTGCCCCAGGCGGTGTGCCTGCTGCATGCGCT 


523 


Qy 


317 


GCTGGAGAAGGGACACATCGTGTACGTGCACTGCAACGCTGGGGTGGGCCGCTCCACCGC 


3 76 


Db 


524 


M 1 M 1 1 1 M M 1 M II II 1 M 1 1 M 1 1 M II M 1 1 1 1 1 M 1 M 1 1 1 1 II 1 1 M 1 II 1 1 1 

GCTGGAGAAGGGACACATCGTGTACGTGCACTGCAACGCTGGGGTGGGCCGCTCCACCGC 


583 


Qy 


377 


GGCTGTCTGCGGCTGGCTCCAGTATGTGATGGGCTGGAATCTGAGGAAGGTGCAGTATTT 


436 


Db 


584 


II M 1 II 1 1 1 II 1 1 II 1 1 1 M M 1 1 1 1 1 II 1 II 1 1 M 1 M 1 1 1 1 1 M 1 Illllll 

GGCTGTCTGCGGCTGGCTCCAGTATGTGATGGGCTGGAATCTGAGGAAGGTGCAGTATTT 


643 


Qy 


437 


CCTCATGGCCAAGAGGCCGGCTGTCTACATTGACGAAGAGGCAGCTAGCCAGGACACATT 

1 M 1 II M 1 1 II M MM III MM III M II 1 1 II 1 1 M II 1 1 1 1 1 1 1 1 

CCTCATGGCCAAGAGGCCGGCTGTCTACATTGACGAAGAGGCAGCTAGCCAGGACACATT 


4 96 


Db 


644 


703 


Qy 


4 97 


TCCACTATAATTTTACAAAGTTAAATTTATAAGCTAGCATTAAGTAAAGTGAAGTCCAGC 


556 


Db 


704 


I 1 1 1 II II 1 1 II 1 1 1 M 1 1 1 II 1 II 1 II 1 1 1 1 1 II II M 1 1 1 II 1 1 1 1 1 II 1 1 II M M 1 

TCCACTATAATTTTACAAAGTTAAATTTATAAGCTAGCATTAAGTAAAGTGAAGTCCAGC 


763 


Qy 


557 


TCCCTTGCTAAAAATAACTAGAGGTAATAATTGGTATTCAGGTAACTCATTTACAGTCAT 


616 


Db 


764 


1 1 II II II II 1 1 II 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 II 1 1 M II 1 1 1 1 1 1 II 1 M 1 1 1 

TCCCTTGCTAAAAATAACTAGAGGTAATAATTGGTATTCAGGTAACTCATTTACAGTCAT 


82 3 


Qy 


617 


AATGTGTTGTGAAAATTTAATCTTAAAAATTAAATTTTTAAACTATGTGGGTCTGTGAAT 


£. 1 £Z 


Db 


824 


1 1 II 1 1 1 1 M 1 1 1 1 II II 1 1 II 1 1 1 1 1 1 II 1 1 II M M II 1 1 M M 1 1 Illllll 

AATGTGTTGTGAAAATTTAATCTTAAAAATTAAATTTTTAAACTATGTGGGTCTGTGAAT 


ft ft ^ 


Qy 


677 


TTCTTTAATGTCTAAGAAATCCAGCTTCATAATTTCCATGATACAAAGATCTTTTTTCAG 

1 1 1 1 1 1 M 1 M 1 1 1 II II M II II 1 II 1 1 M 1 1 1 1 1 1 1 II 1 M 1 1 II 1 II 1 1 1 II 1 II 1 1 

TTCTTTAATGTCTAAGAAATCCAGCTTCATAATTTCCATGATACAAAGATCTTTTTTCAG 


736 


Db 


884 


943 


Qy 


737 


GTGGATTTTTACCTTTGTTCCTTTTGCTCTGATAGACAAAATCAGTTTAGGACTATTAAA 

' ,||| " MM IM U, 1 "'• il 

GTGGATTTTTACCTTTGTTCCTTTTGCTCTGATAGACAAAATCAGTTTAGGACTATTAAA 


796 


Db 


944 


1003 


Qy 


797 


GAATGTTTTGGAATAAACTGTCTTTTTCCTCAAT 83 0 

IIIIIIIIIIIIIIIIIIIIIIIMIIIIIMM 

GAATGTTTTGGAATAAACTGTCTTTTTCCTCAAT 103 7 




Db 


1004 





